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Abstract 

A fc-uniform, d-regular instance of Exact Cover is a family of m sets F„ : d,k = {Sj C {1,... ,n}}, 
where each subset has size k and each 1 < * < n is contained in d of the Sj. It is satisfiable if there 
is a subset T C {1, ... ,n} such that | T fl £.,■[ = 1 for all j. Alternately, we can consider it a d-regular 
instance of Positive 1-in-A; SAT, i.e., a Boolean formula with m clauses and n variables where each 
clause contains k variables and demands that exactly one of them is true. We determine the satisfiability 
threshold for random instances of this type with k > 2. Letting 


( fc- 1 )( -i n( 1 -iA)) ’ 

we show that F„ t d,k is satisfiable with high probability if d < d* and unsatisfiable with high probability 
if d > d*. We do this with a simple application of the first and second moment methods, boosting the 
probability of satisfiability below d* to 1 — o(l) using the small subgraph conditioning method. 


1 Introduction 

A fc-uniform d-regular instance of Exact Cover, or equivalently a Positive 1-in-A: SAT formula, has 
n variables and m clauses where dn = km. We can treat it as a bipartite multigraph, with n variables 
of degree d on one side connected to m clauses of degree k on the other. A satisfying assignmnent is a 
subset T of the variables such that exactly one variable in each clause is true. 

We choose random formulas F„ : d,k according to the configuration model: that is, we make d copies 
of each variable and fc copies of each clause, and choose a uniformly random bipartite matching of the 
resulting dn = km copies with each other. We assume that d,k = 0(1) so that m = 0(n). 

We determine the satisfiability threshold for these formulas. Namely, we prove the following. 

Theorem 1. Let 

iX-T-W) +1 - (1) 

Then for any k > 3 and any integer d, 


lim Pr [F ni d,k is satisfiable] = < 

’ I 1 d < dfc. 

Note that when k is large, dj ss (lnfc) + 1. Note also that d% is never an integer, since then k would be 
a rational power of k — 1. 

An easy application of the first and second moment method gives unsatisfability w.h.p. for d > d£, 
and satisfiability with positive probability for d < d£. We boost the latter to high probability with the 
small subgraph conditioning method mm- 

The fact that the second moment method is exact suggests that, at least in the d-regular case, 
this problem does not have a condensation transition. In contrast, for Graph Coloring, NAE-fc-SAT 
and fc-SAT, at a certain density condensation occurs mm-- the set of satisfying assignments becomes 
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dominated by a single cluster, and the number of satisfying assignments becomes much less concentrated. 
Thus while the second moment method gives fairly good bounds for these problems @00121, pushing it 
beyond this point requires much more sophisticated methods that count clusters of solutions, and further 
reduce the variance by carefully conditioning on the distribution of neighborhood structures throughout 
the formula m\mm This line of work recently culminated in a proof of the threshold conjecture for 
fc-SAT for sufficiently large k na, although many open questions still remain. 

Here the situation is much simpler. The only source of variance in the number of satisfying assignments 
is the number of cycles of each length in the formula, so the small subgraph conditioning method reduces 
the variance enough to prove satisfiability with high probability. It also turns out that that the point 
corresponding to two independent satisfying assignments is a local maximum of the rate function for the 
second moment, so there is no need to reweight the assignments as in BOB EH- 

The second moment method also owes its success to the fact that, in the d-regular case, Positive 
1-in-A; SAT is “locked” in the sense that most variables cannot be flipped without also flipping many 
others, so that satisfying assignments are isolated PI ITS] . Given a set S € {1,..., k — 1} let S'-SAT 
be the problem where each clause has k variables, and demands that the number of true variables it 
contains is an element of S. If S does not contain any adjacent pairs i,i + 1, and if every variable has 
degree at least 2, these problems are locked. In m the authors wrote the first and second moments for 
this family of problems, described the resulting bound as a fixed point equation, and conjectured that it 
is exact. This paper proves the case of their conjecture where S = {1}. 

One can also consider random Positive 1-in-A: SAT formulas where clauses appear independently, 
so that the degrees of the variables are Poisson distributed. A lower bound on the threshold in this 
model was given in [18] for k = 3 using differential equations. Other constraint satisfaction problems for 
which the threshold can be computed exactly (and where condensation does not appear to occur) include 
random XOR-SAT [191 l20l [2T1 as well as 1-in-fc SAT [22] where literals are negated with probability 1/2 
as opposed to the positive case we consider here. 

We write /(n) ~ g(n) if lim„_K x, f(n)/g(n) = 1. We say a series of events E n holds with high 
probability if Pr[l?„] ~ 1, and with positive probability if, for some constant B > 0, Pr [E n \ > B for all 
sufficiently large n. 

2 The first and second moments 

In this section we compute the first and second moments, and show that E [Z 2 ] /E [Z] 2 tends to a constant. 
This is enough to show that F n ,d,k is satishable with positive probability for d < *, and we improve this 
to high probability in the following section. 

Lemma 1. If d > then F„ t d,k is unsatisfiable with high probability. 

Proof. Let Z be the number of satisfying assignments T. Since |T| = n/k, the expectation of Z is ( n " fc ) 
times the fraction of matchings, for a given T, that connect each clause to exactly one of the dn/k = m 
copies of variables in T. Applying Stirling’s formula x\ = (1 + o(l))\/2ivx x x e~ x gives 

E [Z]= ( n . ^ rnlk m ((k- l)m)\ = fcm / n \ / f km \ _ e (n-m)h(l/k) = _ (2) 

\ n/k I (km)'. \n/k J \ m I 

where 

<f>i = j- In k — (d — 1) h(l/k) 
k 

In k + (d — l)(k — 1) ln(l — 1/fc) 

k [ ’ 

and 

h(a) = —a In a — (1 — a) ln(l — a) 

denotes the Shannon entropy function. Since (j>\ = 0 when d = dj), and <j >i is a decreasing function of d, 
we have Pr [Z > 0] < E [Z] = whenever d > d%. □ 
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Lemma 2. If fc > 2 and d < d% then F n ,d,k is satisfiable with positive probability. 

Proof. The second moment is the expected number of pairs of assignments T, T' that are both satisfying. 
This depends on the size of their difference. For a given w € [0, 1], let zffl denote the expected number 
of satisfying pairs with | T — T'\ = \T' —T | = wn/k. For a given such pair, (1 — w)m of the clauses must 
be satisfied by a variable in T fl T', and the remaining wm clauses must be satisfied both by a variable 
in T — T' and one in T' — T. The number of such matchings is 

^ ((1 — w)m)\ k ( - 1 ~ w ^ rrL ( wm)\ 2 (fc(fc — 1))™ m (( k — (1 — w) — 2 w)m)\ 

= k m (k — l)™ m m\ ( wm)! ((k — 1 — w)m)\. 


( m 
wm 


Thus 


E 



{ n \ ( n / k \ f (l — l/fc)n\ m\ ( wm)\ ((k — 1 — w)m)\ 
\n/k) \wn/k J l wn/k ) ( km)\ 


= E [Z] (k - l) wrn 



f( 1 - l/k)n\ 
1 wn/k ) 


(k — l)m 
wm 


(4) 


For 0 < w < 1, applying Stirling’s formula to ((4j) gives 

L - 1 v27 rn 

where 

/H=d vW^" (5) 

and 

Mw) = + X ln ( fe - !) + -j: h (w) ~ (d- 1) ^1 - ^ 

As in jB], we can approximate the second moment by an integral, which we evaluate asymptotically 
using Laplace’s method. If <j> 2 (w) has a unique maximum w ma x £ [0,1] where 0 < w max < 1 and 
02 (kWx) < o, then 


E [Z 2 ] = E \ Z ’ 

w=0,k/n,2k/n,... 

1 n 


( 2 ) 


y/2im k 


I' dwf(w)e nMw) 
Jo 


k \J-<t>'i{Wn 

In particular, suppose w max = 1 — 1 /k. We have 


_ / (rn ma x) ^n<l>2 (wmax) 


(7) 


</> 2 (l — 1/fc) = 20! , 

which corresponds to the fact that 1 — 1/fc is the typical value of w if the two sets T,T' are chosen 
independently. Thus if <j >2 is maximized at 1 — 1/fc, and if < 0 there, we have E[Z 2 ] ~ CE[Z] 2 for 
some constant C. 

The following lemma shows that this is in fact the case whenever d < d/.. 

Lemma 3. Let fc > 2 and d < d/,. Then ui max = 1 — 1/fc is the unique maximum of 02 (w) in the unit 
interval, and ^'(wnuuc) < 0. 
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Proof. By direct calculation we have 0^(1 — 1/A;) = 0 and 


02 ( 1-1 /k) 


k{k - d) 

(.k - l ) 2 ’ 


which is negative since dj) < k for all k > 2. Thus 1 — 1/A: is a local maximum. To show that it is unique, 
note that 02 has a unique inflection point w$ where 02 = 0 , namely 


(d — 2) (A; — 
dk — d — k 


This implies that 1 — 1/A: is the only local maximum. Thus we just have to eliminate the possibility that 
the maximum of 0 2 in the unit interval is at w = 0 or w = 1. But this is easy: since d < df. we have 
0i > 0, so 02(0) = 0i < 20i = 02(1 — 1/A:), and as w —» 1. At the other end of the interval, as w —> 1 
we have 02 (w) —1 —00 due to the h(w) term in @. □ 

Plugging Lemma [3] into the Laplace method 0 gives 


£ [z 2 ] ~ • 

and combining this with © gives 

E[^ 2 ] [k^T 

E [Z] 2 \ k-d 

It follows that Pr[Z > 0] > 1/C, completing the proof. 


( 8 ) 

□ 


3 Small subgraph conditioning 

When there are strong correlations between the events that a pair of assignments are both satisfying, 
the variance E[Z 2 ] — E[Z] 2 is a constant times E[Z] 2 , and the second moment method can only prove 
satisfiability with positive probability. However, in some cases we can show that the variance is much 
smaller if we condition on the number of small subgraphs in the formula—in particular, the number of 
cycles of each constant length. This technique was introduced in [T] , where it was used to show that 
random 3-regular graphs possess a Hamiltonian cycle with high probability; another application 25] 
showed that random 5-regular graphs are 3-colorable with high probability. 

Let Xi be the number of cycles of length 2 i in the formula, i.e., cycles alternating between i distinct 
variables and i distinct clauses. Our goal is to compute the correlation between Z and Xi and its higher 
moments, and hence to learn to what extent Xi affects the number of satisfying assignments. Our goal 
is to explain almost all of the variance in Z with the variance in the Xi. 

Let (x) r denote the falling factorial x(x — l)(a: — 2) • • • (x — r + 1); thus (Xi) r is the number of ordered 
lists of r cycles of length 2 i. If X is Poisson with mean A, we have E[(X) r ] = A r . We use the following 
“plug and play” version of the subgraph conditioning method from [2]. 

Theorem 2. Let Z and Xi,X 2 ,... be nonnegative integer-valued random variables. Suppose that 
E [Z] > 0, and that for each i > 0 there are constants Xi > 0, Si > —1 such that 

1. For any j, the variables A'i,... ,Xj are asymptotically independent and Poisson distributed, with 
E[Xi]~A i; 

2. For any sequence mi,..., rrij of nonnegative integers, 

eIzuLA^U] > 

-Epfj-ILC* where fU = Xi(l + 6i), (9) 

3. is finite, and 

(£>,?). ( 10 ) 

Then Pr [Z > 0] = 1 — o(l). 
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Applying this technology to prove the following theorem, and thus complete the proof of Theorem [l] 
is an enjoyable exercise in combinatorics. 

Theorem 3. If k > 2 and d < d% then F ni d,k is satisfiable with high probability. 

Proof. Standard arguments for sparse random graphs [24) show that the X,, are asymptotically indepen¬ 
dent and Poisson distributed. To compute the asymptotic expectation X, note that there are ( m)i(n)i 
sequences of clauses and variables that C could visit; since there are i variables where we could start a cy¬ 
cle and two directions in which we could go, this overcounts by a factor of 2 i. There are (k(k—l)d(d— 1))* 
choices of copies with which to wire each variable to the clause before and after it in the sequence, and 
the number of matchings that include a given such wiring is (km — 2i)!. Thus 

E[X t ] = ^(m)i(n)i(fe(fe — l)d(d — 1))' ~ ^ ~ ^ ~ ^ ^ ■ (11) 

In order to establish ©, we first warm up by computing E[ZA'i]. This is the sum over all pairs (T, C), 
where T is an assignment and C is a cycle of length 2 i, of the fraction of matchings containing C for 
which T is satisfying. 

We start by choosing one of the (Y}k) possible satisfying assignmments T. We then choose C. First, 
we choose t = \C fl T|, the number of true variables in C. Let us think of C as a cycle of i variables, 
where the edges between them correspond to their shared clauses. Since each clause must contain exactly 
one true variable, none of C”s true variables can be adjacent; in particular, t < L*/2J. (This is similar 
to JTJ, where no two adjacent edges of C can belong to a Hamiltonian cycle.) Let W,t be the number 
of ordered, labeled cycles with t true variables, where no two true variables are adjacent; for instance, 
N 6 ,o = 1, Ng,i = 6, Ng ,2 = 9, and Ng } 3 = 2. 

Now that we have chosen t, and chosen one of the W,t arrangements of true variables in it, we choose 
what variables and clauses C contains and how they are matched to each other. There are ( m)i ordered 
sets of i clauses, and ( n/k)t ((1 — l/k)n)._ t choices of which true and false variables appear in C and in 
what order. As before, there are (k(k — 1 )d(d — 1))* ways to wire each variable to the clause before and 
after it, and all this overcounts by a factor of 2 i. 

At this point in the process, we have already satisfied 2 1 clauses in C, so there are m — 2t clauses 
waiting to be satisfied. Happily, we have dn/k — 2t = m — 2t unmatched copies of true variables with 
which to satisfy them. The m — i clauses outside C have k unmatched copies each, and the i — 2t clauses 
in C that are not yet satisfied each have k — 2 unmatched copies. Thus there are (m — 2t)\ orders in 
which we can assign copies of true variables to clauses, and k m ~ 1 (k — 2) l ~ 2t ways to match them with 
these clauses’ copies. After all this, there are (k — 1 )m — 2 (i — t) unmatched copies of false variables, 
which can be matched with the remaining clause copies arbitrarily. Finally, we divide by ( km)\ to obtain 


E [ZXi] = 


n/k 


P /2J r „ 

(rrx)i (rz./fc)t ((1 — 1/A;)ri) i t (fe(A; — l)rf(rf — l)) 1 

t—0 1 

(to - 2t)! fe m -*(fe - 2) i-2t ((fc - l)m - 2 (i - t))\ 

X (km)\ 


Dividing by E [Z] and using (m)i ~ to*, to!/(to — 2t)! ~ m 2t and so on gives 

W 2J 

' 2 i 


- 1 / fc ) n ) i -t( fc ( fc - 1 ) d ( d - 1 )) 1 


(to - 2£)! k^-^k - 2y~ 2t ((k - 1 )m - 2(j - t))\ 
m\ k m ((k — l)m)! 


((k-2)(d-l)Y L ^ J 

— 2^ Ni * 


2 i 


k- 1 


— fix — Aj(l + Si). 
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where 


Si = 


fc - 2 
k- 1 


i U/2J 


Em 


fc-1 

(fc - 2) s 


- 1. 


We can evaluate this sum with the generating function 

P/2J 


g (z) = Ni, t z * = tr (^- Yj 


giving 


St = 


k -2 
fc- 1 


i + yi + 4zY / 1 - vi+ 4« 


fc -1 


- 1 = - 


fc-1 


( 12 ) 


Xk-2)\ 

Generalizing this calculation to show that © holds is a matter of bookkeeping. Let t = X)s=i mi > 
and let ii,..., be a sorted list where each s appears m s times. Then E [z rii=i J is the expected 

number of tuples (T, Ci,..., Ci) where T is a satisfying assignment and each C a is a cycle of length 2 i s . 
Counting as before gives 


E 


[z nti L g J L g J 


E [Z] 


n 


2 i s 


t 1= 0 t 2 = 0 t£=0 

X ( m )£ s iA n / k )T, B ts ((! - l/k)n)^ AU _ ta) (k(k- l)d(d- 1 )) E ' S * S ] 
{m-2J2 s *.)! fc m (fc - ((fc - l)m - 2 EM - t„))\ 

to! fc m ((fc — 1 )to)! 


A ((fc — 2)(d — l))* 3 L Y 2J Ar 

II- oT - E 


S=1 

* 


fc - 1 

(M 




Finally, we establish ca- Using the Taylor series — log(l — z) = EE z */* gives 

fc- 1 


E«EeHE) -iv. 


fc — d 


and comparing with © shows that this is indeed the logarithm of the asymptotic ratio C ~ E [Z 2 ] /E[Z] 2 . 
This completes the proof. □ 
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